AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
EN

AI News

View More

Google Releases Gemma 3 QAT Model: Runable on a Single RTX 3090

Google recently released a new version of its Gemma3 series, exciting many AI enthusiasts. Just a month after its initial launch, Google released a Quantization Aware Training (QAT) optimized version of Gemma3, aiming to significantly reduce memory requirements while maintaining model quality. Specifically, the QAT-optimized Gemma3 27B model reduces VRAM requirements from 54GB to 14.1GB, meaning users can now run it on a single NVIDIA RTX 3090.

13k 21 hours ago
Google Releases Gemma 3 QAT Model: Runable on a Single RTX 3090

Models

View More

Pangu-NLP-N4-4K-3.2.36

Huawei

Pangu-NLP-N4-4K-3.2.36

-

Input tokens/M

-

Output tokens/M

4

Context Length

Pangu-NLP-N4-32K-2.5.35

Huawei

Pangu-NLP-N4-32K-2.5.35

-

Input tokens/M

-

Output tokens/M

32

Context Length

Pangu-NLP-N2-32K-3.1.35

Huawei

Pangu-NLP-N2-32K-3.1.35

-

Input tokens/M

-

Output tokens/M

32

Context Length

Yi-9B-200K

01-ai

Yi-9B-200K

-

Input tokens/M

-

Output tokens/M

200

Context Length

Yi-34B

01-ai

Yi-34B

-

Input tokens/M

-

Output tokens/M

4

Context Length

AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2026AIBase
Business CooperationSite Map